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Abstract. In this paper we give a mathematical model for a game that 
we call picture cube puzzle and investigate its properties. The central 
question is the number of moves required to solve the puzzle. A mathe- 
matical discussion is followed by the description of computational results. 
We also give a generalization of the problem for finite groups. 

1 Introduction 

The picture cube puzzle 1 is a puzzle that consists of usually 6, 9 or 12 painted 
wooden cubes that can be arranged in a rectangular pattern to obtain six 
different pictures ( "the solutions" ) seen on the top of the cubes. Each face of 
the cubes contains one piece of one of the six pictures in such a way that the 
position of that piece within the large picture is the same for all six faces. 
Thus, for example, there is a cube whose faces contain the upper left corners 
of the six pictures, another one that contains the lower right corners, etc. 

The cubes are painted so that the solutions can easily be transformed into 
each other: for each picture there are cube rotations whose simultaneous ap- 
plication to all cubes transforms the picture to another one. For example, 
imagine that the puzzle is solved and you can see the picture of the dog on 
top. Pick up the first row of cubes holding them together and rotate the whole 
row around the axis through the centers of the cubes in that row by 90 degrees. 
Do this to all the rows and you obtain the picture of the bear. These row- wise 
or column- wise rotations are our allowable moves for the puzzle, see Figure 1. 
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These puzzles are usually sold in Hungary under the name "mesekocka", meaning fairy 
tale cube. 
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Figure 1: A picture cube puzzle. Rotating each row upwards transforms the 
picture of the numbers into the picture of the letters. We count this as three 
moves (there are three rows to rotate). 



The number of allowable moves is twice the sum of the number of rows and 
the number of columns in the configuration. No move changes the position of 
the cubes within the picture but using the moves we can change which faces 
are on top and the orientation of the top face. A natural question is if we are 
given an arrangement of the cubes where the positions are correct but the 
cubes are arbitrarily rotated in place, can we solve the puzzle? 

For the rest of the paper, we will suppose that one of the solved pictures 
is the solution configuration, and the cubes are somehow individually rotated 
(but kept in place). There are several questions that can be raised about the 
puzzle, we will formalize them in the following section. What are the config- 
urations that can be reached from the solution configuration using allowable 
moves? How many such configurations exist? How many moves are needed to 
reach these configurations in the worst case/on average? We will address these 
questions after introducing a mathematical model using groups for the puzzle. 

These (and other) questions have been raised for more well-known puzzles, 
notably Rubik's Cube [3]. Answering some of them requires the combination 
of non-trivial mathematical ideas with large-scale computer calculations. Di- 
ameters for permutation groups have been investigated e.g. in [1, 2]. 

The present paper is built up as follows. Section 2 gives the mathematical 
model and a generalization for the puzzle. In Sections 3 and 4 we discuss 
solution methods for the cube puzzle. In Section 5, some computational results 
are presented. We give further research directions and a short summary in 
Section 6. 

We refer to any standard textbook, e.g. [4] , for basic facts about groups. 
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2 Mathematical formulation of the puzzle 

In order to formalize the puzzle, we need some notation. Let m and n be 
two positive integers and imagine ran cubes arranged in an m by n matrix 
form in front of us on a table. There is a solution configuration of the cubes 
when a picture can be seen on the top faces that we will refer to as "up" face, 
following standard terminology for Rubik's Cube. We will refer to the other 
faces as down, left, right, front and back faces in the straightforward way. 

There are 2(m + n) moves that allow us to modify the configuration of the 
cubes: up { and downt, (i = 1 , . . . , m) that rotate the cubes in row i along the 
axis through their centers in such a way that their front faces move to the up 
or down positions respectively; and leftj and rightj, (j = 1 , . . . , n) that rotate 
the cubes in column j along the axis through their centers in such a way that 
their up faces move to the left or right positions respectively. We note that 
two consecutive "up" rotations on the same row have the same effect as two 
consecutive "down" rotations on that row (similarly for columns). We count 
these as two moves — using Rubik's Cube terminology, we use the quarter-turn 
metric in the present paper. When these transformations are counted as one 
move, we get the half-turn metric. 

It is a classical observation in many similar puzzles that sequences of moves 
form a group under composition. Allowable moves form a generator set for 
that group and we identify the solution of the puzzle with the identity element. 
Applying a move to a configuration is multiplication by a generator from the 
right. Below we describe the group and the generators that we will use. 

It is a well-known fact of group theory (seen most easily by observing how 
the transformations act on the main diagonals) that the rotation group of 
the cube is isomorphic to S4, the symmetric group of degree 4. Thus we can 
represent any configuration (maybe not reachable by legal moves) of the puzzle 
by a matrix T = (gy) £ S™ xn , where S™ xn is itself a group, being the direct 
product of mn copies of S4. If we fix a way we represent rotations of a cube 
by S4 then there are elements u, d, I, r G S4 such that turning a single cube 
up (resp. down/left /right) corresponds to multiplication (from the right) by u 
(resp. d/l/r). Using cycle notation, one may choose for example u = (1423), 
d= (1324),!= (1342), r = (1243). Note that all of them are odd permutations. 
Then up^ is the transformation that replaces gij by • u for j = 1 , . . . , n, in 
other words, up { is (coordinate-wise) multiplication in S™ xn by an element 
that has n entries of u in the ith row and the identity element in every other 
position, and similarly for the other moves. Thus we may identify up^ and the 
other moves by some elements in S™ xn . The set of moves in this group will be 
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denoted by M. So M = {up i; dowrn | 1 < i < m} U {leftj , right j | 1 < j < n} 
We may now formulate several questions. 

• Reachability. What are the states from which the solution is reachable 
using legal moves, or formally: what is the subgroup H of S™ xn gener- 
ated by M? We are also interested in the order of this subgroup and 
how membership in this subgroup (solvability of a configuration) can 
be decided. Note that reachability between configurations is symmetric 
since every sequence of moves can be executed " backwards" . 

• Solution. Given an element in H, what is a sequence of moves that 
takes it to the identity (solve the puzzle). We are interested in human- 
executable ("easy") and computer-aided ("fast") techniques as well. What 
is the shortest sequence of moves, in other words, what is the shortest 
product of moves that gives h G H? 

• Diameter. What is the maximum length, taken over elements h 6 H, 
of shortest sequences of moves taking h to the identity? In other words, 
how many moves are required in the worst case for solving the puzzle? 

These questions can be analized by using Cayley graphs that are defined as 
follows. 

Definition 1 Let H be a group generated by M. The Cayley graph for H and 
M has H as the set of vertices and there is a directed edge from hi to Y12 iff 
for some m G M, him = h.2. 

Solving the puzzle is the same as finding a path to the identity; and the 
diameter of this graph is exactly the length of the longest of all shortest paths 
from some h to the identity in the Cayley graph. We return to these questions 
in the next section. 

2.1 Generalization for arbitrary finite groups 

The mathematical formulation above allows us to generalize the problem for 
arbitrary finite groups. Let G be a finite group and let R, C C G (row and 
column moves, respectively). Let n, m be positive integers and consider the 
group G mxn with pointwise multiplication. For each re R and 1 < i < m 
let T{ e G mxn De a matrix which has n entries equal to r in the ith row and 
the identity element e € G in other rows. (Here V stands for row, not to be 
confused with the r for the original S4 puzzle, where it is short for right, and 
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is a column move.) For each c G C and 1 < j < n let c, 6 G mxn be a matrix 
which has m entries c in the jth column and the identity element e G G in 
other columns: 




/ e e 
e e 
e e 



e e 
e e 
\ e e 



e c e 
e c e 
e c e 



e c e 
e c e 
e c e 



e 
e 



e 
e 



Definition 2 Given G, R, C, m and n as above, the set of row moves is 
{V{ | r G R, 1 < i < m}, i/te set of column moves is {q | c € C, 1 < j < n}. 
The set of moves is their union M = M(G, R, C, m, n) = {r^ | r G R, 1 < i < 
m} U {Cj | c G C, 1 < j < n}. 

Definition 3 Given G, R, C ; m and n as above, the set of reachable config- 
urations is H = H(G,R,C,m,n) = (M) < G mxn . 



We can ask what the subgroup H is, how we can give a product of moves for 
an h. G H and what the diameter of the Cay ley graph for H and M is. These 
questions seem hard to answer in general, so we only focus on a few special 
cases that will be useful in the S4 case which is a special case with G = S4, 
R = {up, down}, C = {left, right}. 



2.2 Abelian groups 

If G is Abelian, then so is G mxn , meaning that the order in which we perform 
the moves has no effect on their product. Therefore, instead of sequences of 
moves, we can speak of sets of moves. Denote by 04 (resp. bj) the product 
of moves performed on the ith row (resp. jth column). Note that at is not 
necessarily an element in R. Then the product of all the moves performed has 
the matrix form T = ( gij ) where gtj = atbj. We claim that the first row and 
the first column determine all other elements in T. To see this, let i, j > 1 , 
then gtj = a^bj = ((nbjHaib! ){a^ T 1 = gijgiig^ 1 . 
Consider the special case when R = C = G. 
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Theorem 4 If G is Abelian, R = C = G and n, m G N + , then the reachable 
configurations are H = {(ciibj)i = i ) ... )mj j = i ) ... )n | a, 6 G,bj G G}, and uie Ziaue 
|H| = \G\ m+n ~\ 

Proof. For any configuration in G mxn , we can solve the first row by using 
column operations, then solve the remaining elements of the first column by 
row operations. If the rest is not solved at this point, then the configuration 
is not in H, because the remaining elements of the configuration are uniquely 
determined by the first row and the first column for elements of H. Thus H 
has configurations of the form (aibj)^...^^...^ if we let e.g. cu. = gn and 
bj = gij/gii- It is also clear that H contains every element of this form. The 
number of required moves to solve the puzzle is m + n — 1 in the worst case. 
The order of H is |G| m+n_1 , because that is the number of ways we can adjust 
the elements in the first row and the first column. □ 

If we weaken the conditions but require that both R and C generate G , then 
the solution method can be the same, but for solving the first row and column, 
a sequence of moves is required for each element. 

We note that at least one special case of this Abelian version is part of 
mathematical folklore: when the group is the two-element group and the only 
moves are multiplication by the generator. This is often told with coins (and 
sometimes with lamps) arranged in a matrix form and allowed moves being 
the simultaneous turning over of entire rows or columns. The question is how 
we can tell if an initial configuration can be transformed into the "all heads" 
configuration. 

The discussion gets more complicated when at least one of R and C generates 
only a nontrivial subgroup of G, but since we do not need this for the cube 
puzzle case, we omit the analysis of this case for brevity. 

2.3 Simple groups 

Another special case is when G is a non-abelian simple group. We investigate 
this case because the method used for solving it - the method of commutators 
- is also useful for the cube puzzle. 

Definition 5 Let G be a group, gi,g2 & G. The commutator of gi and g2 is 
t9b 92] = 9'\929^92 ] ■ This is the identity element if and only if gi and gi 
commute. Note that in some sources in the literature, the inverses come first 
in the definition. 

Lemma 6 Let G be a group, R, C C G and m, n as above, r £ R, c G C, 
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Figure 2: The figure illustrates how a commutator of a row move and a column 
move modifies only one entry. The sequence of moves upi , lefti , downi , righti 
cancels for all entries except at position 1,1. 

1 < t < m, 1 < j < n. Then multiplying a configuration by [ft, Cj] only affects 
the entry at position i, j . 

Proof. Denote the entries of the configuration by gi'j'. The transformation 
t = [r^, Cj] = TiCjr^Cj -1 can only alter elements in the ith row and the jth 
column. But if j' ^ j, then Cj does not modify the elements in column j', hence 
the effect of t on gy/ is multiplication by r^rr 1 = e, the identity. Similarly, g^/j 
is left intact by t if i' ^ i. Thus, the only element that can be affected is gy. 
The effect on this element is multiplication by [r, c] = rcr -1 c _1 . See Figure 2 
for an illustration. □ 

Using this method we can individually modify the elements of (gtj) by com- 
mutators of the form [r, c] or [c,r]. More generally, using sequences of moves 
we can individually modify gy by [r, c] or [c, r] for r G (R), c G (C). Hence we 
have the following result. 

Theorem 7 Let G be a finite non-abelian simple group, R = C = G, m, n G 
N + . Then H = H(G, R, C,m, n) = G mxn , in other words, every configuration 
is solvable. 

Proof. The commutator subgroup of G, denoted by G' is the subgroup gen- 
erated by all commutators [gi,g2l for gi,g2 G G. By the above lemma, we 
can alter any individual entry of a configuration by any element in G'. But G' 
is always normal in G, hence if G is a finite simple non-abelian group, then 
G' = G. Therefore any configuration can be solved, by individually solving all 
entries. □ 

Note that the theorem also holds if we only assume (R) = (C) = G - note 
that for r G R, r _1 G (R) by the finiteness of G. Also note that for Abelian 
groups, the commutator method is useless, since we have G' = {e}. 

If G is a finite simple non-abelian group and (R) = (C) = G, then the 
diameter of the Cayley graph can be bounded both from below and above for 
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general m and n using constants that depend only on G, R, C. The number of 
vertices of the graph is |G| mn , the graph is (m|R| + n|C|)-regular. From this we 
get the lower bound log(|G| mn )/log(m|R| +n|C|) — 1 = constimn/log(m|R| + 
n|C|). For the upper bound, note that we can solve each configuration in const2 
moves. Thus the number of moves required is at most const2 • mn: 

const i • mn „ ,„ , ,,, . . 

< diam(Cayley(H, Mj) < const2 • mn. (1) 



log(m|R| + n|C|) 



The gap between the lower and upper bounds is logarithmic in m and n. It 
would be interesting to reduce this gap. We conjecture that the diameter is 
closer to the upper bound. 



3 Local solution method for the cube puzzle 

We return to the analysis of the original cube puzzle, where G = S4 and 
the row moves are turning up and down (R = {u, d}), the column moves are 
turning left and right (C = {I, r}). The subgroup in S™ xn generated by all 
moves is again denoted by H. In this section we provide a decision method for 
solvability and present a method for solving using commutators. The method 
is simple and easy to perform for humans. We suppose that some configuration 
s € S™ xn is given and we want to decide if it is in H, and if it is, we want to 
solve it. 

We break the solution of the puzzle into two parts. First, we look for a 
sequence of moves t that transforms s to st G A™ xn , that is, we try to make 
every entry in s into an even permutation. If this is impossible, then s H. If 
it is possible, we will solve the puzzle by using commutators. 

For the first part note that since an up, down, left or right move toggles 
the parity of the affected elements (seen as permutations in S4), the task of 
making every entry an even permutation can be reduced to solving a puzzle for 
the two-element multiplicative group {—1,1} where the legal row and column 
operations are multiplication of the row or column by — 1 . The Abelian analysis 
in the previous section shows that this can be examined by solving the first 
row and the first column and seeing if the rest is solved. If the rest is not 
solved, there is no solution for the puzzle. If there is a solution, it is found in 
at most m+n — 1 steps. To perform the first part, the player has to remember 
which rotations correspond to even and odd permutations, but this is not too 
hard. 

In the second part, we individually rotate all mn cubes to their solved 
positions, using the method of commutators. One readily verifies that commu- 
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tators of the form [u.;., [ui,Tj] etc. (altogether 8 types for a fixed pair i, j) 
are sufficient to transform any element of A4 to the identity in at most 8 steps 
(two commutators suffice). This part then requires a worst case 8mn moves. 
The overall number of steps needed to solve any solvable configuration using 
this local method is thus at most m + n — 1 + 8mn. 

We summarize the above discussion in the following theorem. 

Theorem 8 Let m,n € N + . Then |H| = |H(S 4 , R, C, m, n)| = 2 m+n_1 ■ 12 mn . 
The diameter of the Cayley graph of H and M is at most 8mn + m + n — 1 . 

4 Solution method using subgroups 

Another method borrowing ideas from the Rubik's Cube literature [5, 6] is 
the method of subgroups generated by restricted moves. The method of the 
previous section can be considered as a special case of this method, but in 
general, the method is not intended for human execution. The method is most 
useful when the search space for finding the overall shortest path in the Cayley 
graph is too large, but a computer-aided search using subgroups finds shorter 
paths than the commutator-based method. 

Take any nontrivial subgroup Hi of H, in practice a subgroup with compa- 
rable index and order is preferable. Let the input for the method be an h G H, 
and we proceed in two parts. First find the shortest sequence of moves that 
takes h. into Hi , then solve the problem for Hi . One can also use a chain of 
subgroups, but we only consider the case of one subgroup. The subgroups of 
interest here can admit the following form. Let K be a subset of {1 , . . . , m}. 
We allow transformations generated by all possible moves with the restriction 
that for rows with index i G K, only double moves u? = d? are allowed (in 
the half-turn metric this is extremely useful, since these moves count as one 
move). This method can be used to find a shorter move sequence than the 
naive method that is not necessarily optimal. The optimal choice of K is a 
nontrivial issue, in practice, computer experiments can be used to choose a 
good size for K. 

5 Computational results 

For small values of m and n we can use a computer to determine the diameter 
or other properties of the Cayley graph in our problem. The number of pos- 
sible configurations grows exponentially in nm, so even for moderate values, 
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Size 


Diameter 


Average 


Median 


Local est. 


Nr. of conf. 


1 x 1 


4 


2.17 


2 


9 


24 


2 x 1 


7 


4.44 


5 


18 


576 


3 x 1 


10 


6.59 


7 


27 


13824 


4 x 1 


12 


8.59 


9 


36 


331776 


2x2 


12 


7.82 


8 


35 


16588 


5 x 1 


15 


10.69 


11 


45 


7962624 


6 x 1 


17 


12.65 


13 


54 


191102976 


3x2 


14 


10.54 


11 


52 


47775744 



Table 1: This table lists the maximal/average/median number of optimal 
moves needed to solve a puzzle of the given size. It also lists the number of 
moves that the local method from Section 3 takes and the number of reachable 
configurations. 



Distance 





1 


2 


3 


4 


Number of points 


1 


10 


69 


456 


2846 


Distance 


5 


6 


7 


8 


9 


Number of points 


16208 


84428 


395566 


1622641 


5536264 


Distance 


10 


11 


12 


13 


14 


Number of points 


13587945 


17558644 


8100138 


843444 


27084 



Table 2: The number of reachable configurations that are 0, 1, . . . , 14 moves 
away in the 3x2 puzzle. 



we quickly run into memory storage problems. We represented configurations 
in a compact form using base 24 integer numbers. For the determination of 
the diameter and average distances, a breadth-search is performed, where the 
neighbors of a configuration are calculated using pre-stored 24-element rota- 
tion arrays - we listed in advance how the moves transform cube positions. 

The tables summarize the results. Table 1 lists the number of reachable 
states, the average and median distance and the maximal distance for vari- 
ous puzzle sizes. Table 2 has details about the 3x2 case, listing how many 
configurations can be found on individual levels of the tree. 
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6 Summary and further directions 

We gave a model for the picture cube puzzle which allowed us to answer 
some naturally arizing combinatorial questions. Mathematically, the study of 
general G,R, C and the asymptotic analysis of the diameter is the planned 
continuation of the present work. From a computational point of view, the 
further investigation of the exact values of the diameter for small m, n in the 
cube puzzle is our future plan. 

Replacing the 2-dimensional configuration by higher dimensional ones is also 
a possible extension. 
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